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ABSTRACT 

The two-point correlation function, £, of Lyman-a forest is found to be large, 
£ = 1.8^2, > 90% confidence level, on the scale of 250-500 km/s for a sample 
of absorbers (0 < z < 1.3) assembled from HST Key Project Observations. This 
correlation function is stronger than at high redshift (z > 1.7) where £ ~ 0.2 for 
velocities > 250 km/s. 



Subject headings: quasars: absorption lines 



1. Introduction 

Many correlation studies of the Lyman-a forest have been made in the past decades. 
Almost all of these investigations were limited to redshifts greater than ~1.7, since they 
were carried out on the ground. After many years of controversial weak detections of the 
two-point correlation function at z > 1.7 on scales < 300 km/s ([Webb 1987] , |Ostriker" 
Bajtlik, fc Duncan 1988| , [Rauch et al. 1992| , (Srianand fc Khare 1994| , |Chernomordik 



|1995| , |Cristiani et al. 1995| , Kulkarni et al. 1996| , [Srianand 1996| ), a consensus may be 
developing that there is a small, but finite correlation, £ « 0.2 on scales less than 300 km/s 
(see pristiani 1995| for a review). In contrast, based on the finding of Cowie et al. (1995), 
which shows that some high-redshift Ly-a lines have many weak associated carbon lines 
described as "cloudlets," Fernadez-Soto et al. (1996) report an extremely strong correlation 
function, £ w 10, among these weak carbon lines on scales ~ 200A;m/s. This finding begs 
the question of what an individual Ly-a system is. At scales larger than 300 km/s, the 
correlation at z > 1.7 is to quite weak or undetectable QSargent et al. 1980| , [Sargent, Young 
Schneider T982j [Bechtold 1987| , |Webb fc Barcons 1991| , |Cristiani 1995| , (Srianand 1996]) . 
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Recently, UV observations with Faint Object Spectrograph of the Hubble Space 
Telescope have opened up a new regime of redshift space for study, z ~ 0-1.3 (Bahcall 
et al. 1991, 1992, 1993a (CAT1), 1993b, 1996 (CAT2); Morris et al. 1991). A reasonably 
strong correlation function among these low-redshift systems might be anticipated because 
(1) gravitational correlations tend to grow with time, and (2) very low redshift observations 
of galaxies near quasar lines of sight which show that perhaps one third or more of the 
Ly-a forest lines are associated with galaxies ([Lanzetta et al. 1995| ; see also |BahcaH 



|et al. 1992j , [Bpinrad et al. 1993| , (Barajedini, Green, fc Jannuzi 19951) which are highly 



correlated objects. No evidence for a two-point correlation function was found in the first 
analysis of the Quasar Absorption Line Key Project data (CAT1), although evidence for 
clumping of the Ly-a forest lines near metal-line systems was discovered (CAT2). Studies 
of high redshift metal-line systems show strong two-point correlation functions (£ ~ 5 at 
100 km/s) on scales of up to ~500 km/s ( |Sargent, Boksenberg, fc Steidel 1988| , [tieisler 



Hogan, fc White 1989| , [Fetitjean fc Bergeron 199~4] ). Therefore, if the low-redshift Ly-a 



forest lines are either gravitationally evolved counterparts of high redshift Ly-a lines, are 
associated with ordinary galaxies, or clump around the strongly clustered metal lines, they 
should have a relatively strong, yet unmeasured, two-point correlation function. 

In this paper, a complete, well-defined sample of low-redshift absorbers is analyzed 
for a two-point correlation function using techniques that are especially appropriate for 
quasar absorption lines. In particular, a new algorithm for simulating data with a two-point 
correlation function allows for a good determination of the errors in the detected correlation 
function. The paper is organized as follows. In §11, the data are described and the procedure 
for creating a complete data set is presented. Using extensive Monte Carlo simulations, 
the two-point correlation function for the Ly-a forest is shown to be non-zero in §111. In 
§IV, the strength of the two-point correlation is determined, again with use of Monte Carlo 
simulations. Blends are shown to have only a minimal effect on the findings in §V. A 
discussion and summary of the results of the paper is given in §VI. 



2. A Complete Data Set 

In order to create a sample free of experimental biases, a subset of Ly-a lines was 
selected from the published catalogs assembled by the Quasar Absorption Line Key Project 
from Faint Object Spectrograph (R=1300) observations on the Hubble Space Telescope 
( |CAT1| , |C AT2| ) . The selection procedure is similar to that described in §VII of |CAT2| . The 
Ly-a sample adopted here has a uniform detection limit of 0.24 A, rest equivalent width, 
which corresponds to ~ 2 x 10 14 Hi/cm 2 for unsaturated lines. Lines from spectral regions 
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in which the line detection limit, Wm^? was greater than 0.24 A are excluded. In other 
words, no lines are included in the sample which are very weak lines or which come from 
parts of the spectrum where the sensitivity was low. Because of the proximity effect, lines 
within 3000 km/s of the quasar emission redshift are excluded. 

The total sample consists of 100 Ly-a lines. Nine of these are associated with extensive 
metal-line systems with four or more detected metal lines (see |CATT| , PAT2| ) . The mean 
redshift of the lines is 0.7. 

The complete sample used for analysis in this paper is given in Table [I] and Table 0. 
The spectral range for each quasar spectrum where the sensitivity was high enough to meet 
the uniform detection limit criteria is given in Table |3|. 



3. Pairs counts and the Two-point Correlation Function 

The two-point correlation function can be estimated from the pair counts in the data 
according to the formula, 

JT(OhQ(r) 

<l ' Af(Ran)(r) ' [ ' 

where iV(Obs) is the number of observed pairs and iV(Ran) is the number of pairs in a given 
redshift bin that would be expected in the absence of clustering. iV(Ran) is determined by 
Monte Carlo simulations. The same velocity bins are used as employed in the first empirical 
analysis of the two-point correlation function in this data (|CAT1|) . 

In the Monte Carlo simulations, the number of pairs expected in the absence of 
clustering was calculated by placing lines along the part of each quasar line-of-sight where 
the minimum observable rest equivalent width was less than the uniform detection limit 
of 0.24 A. The lines are Poisson distributed with the probability of a line being placed at 
a given redshift being proportional to the assumed density at the corresponding redshift. 
Studies of the number density at both high and low redshift have indicated that 

n(z) ^A(l + Z y, (2) 

where n(z) is the number density, A is a normalization constant, and the exponent 7 is 
between and 1.5 in the redshift range of interest (CAT1,CAT2). To mimic the finite 
resolution of the data in the simulations, lines with separations less than 230 km/s were 
excluded following |CAT1| . For each separate data subset investigated, ten thousand 
simulations were performed which was adequate to fix the error bars reliably. The slope, 7, 
of the number density was varied between and 1.5, which covers the range allowed by the 
observational data. 
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The approach taken here differs from some of the previous analyses of clustering (e.g. 
UAT1| ) in two ways. First, the normalization constant, A, used to produce the number of 



lines along each line-of-sight, is determined for the entire sample and is the same for every 
quasar line-of-sight. In previous papers, the Monte Carlo simulations, used to determine 
iV(Ran) matched the number of simulated lines along the line-of-sight with the observed 
number of lines along the line-of-sight. The strength of the correlation function will be 
underestimated if A is determined, in this way, separately, for each line-of-sight. Monte 
Carlo simulations show the strength of this normalization effect in this data sample to be 
about 20% in the sense that the expected number of pairs is decreased by 20% when the 
normalization done correctly. The effect is most important in data samples with a small 
number of lines along each line-of-sight where Poisson fluctuations of the number of lines 
along each line of sight are significant, as is the case for low-redshift studies. Second, the 
evolution of number density evolution is included in the Poisson distribution. If it were 
neglected, the number of expected pairs would be underestimated, and the strength of any 
clustering would be overestimated, because pair counts scale as the square of the density, 



AiV(Ran) = iV(Ran) 



/ n (z)dz 
J n 2 dz 



- 1 



(3) 



where n(z) is the number density as a function of redshift, and n is the unevolving number 
density. In this sample, the strength of this effect is about 20% as well. 

Figures 1 and 2 show the pair counts of Ly-a systems alone and for the combined Ly-ct 
and metal-line system subset. For Figure 1, pair separations are calculated in terms of 
relative velocity as measured by an observer at the absorption systems, and for Figure 2 the 
separations are shown in terms of comoving coordinates. The relations used to calculate 
the coordinates are (Q — 1) 

c Az /A . 
Av = ——, 4 
1 + z 

and 

c Az 

There do not appear to be any statistically significant differences when the data are 
analyzed in terms of comoving distance rather than velocity. The slope of the number 
density affects the expected number of pairs in the absence of clustering, iV(Ran), and 
therefore the strength of any correlation. The largest number of expected pairs is found 
with the largest value of 7 (1.5) and the smallest number of expected pairs with 7 = 0. The 
results of the Monte Carlo simulations for (7 = and 1.5) are also shown in Figures 1 and 2. 
The actual slope of the number density is likely between these values, and correspondingly, 
the true iV(Ran) lies between the values resulting from these Monte Carlo simulations. 
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For both samples and separation measures, there is a strong excess of pairs in the 
first bin corresponding to 250-500 km/s or 2.5-5 /i _1 Mpc. Table |] shows the results of the 
Monte Carlo experiments that were designed to determine the statistical significance of the 
observed correlation. The probability that an unclustered distribution could produce in the 
first bin an excess as high as observed is less than 0.5% in all the cases considered. The 
probabilities reported in Table [| are conservative in the sense that, 7, was taken to be 1.5 
which is as large as allowed by observation at 2a, so that iV(Ran) was a large as possible. 



4. Strength of the Correlation Function 

The allowed range of strength of the two point correlation function is determined by 
Monte Carlo simulations (see Table f|). Because the correlation function is detected in 
only the first bin of the pair counts, the shape of the correlation function was taken as a 
free parameter. As shown below, this freedom does not strongly affect the range of the 
determined strength. 

To test the strength of the observed correlation function by Monte Carlo simulation, 
an algorithm was devised to create a one- dimensional sample with a specified two point 
correlation function. The many 3-dimensional methods for generating correlated sets of 
objects (e.g. [Neyman, fc Scott 1952| , [Soneira fc Peebles 1977|) diverge in 1-dimension. 



Therefore, a different algorithm was developed for the 1-dimensional application in this 
paper. Details of the algorithm and its relation to 3-D methods will be described in detail 
in a future paper ( |U Inter 1996| ); a brief description is supplied in Appendix A. 



Confidence regions were established by determining what the strongest (weakest) 
simulated correlation function for which X% of the simulations had as few (as many) pairs 
in the first bin as the data. Evolution of the number density (Eq. |2|) has an effect on the 
expected number of pairs in the absence of clustering, and therefore on the strength of 
the correlation function (see Figures 1 and 2). Table £| lists the minimum, £ min , and the 
maximum, £ m i n , values of the correlation function evaluated at the 90% confidence limits 
in the 250-500 km/s bin. The value of £ m i n (£ ma x) was established by requiring that less 
than 5functions smaller (larger) that the value found in the actual sample. The confidence 
regions given in Table ^ account for this uncertainty in a conservative fashion. For instance, 
the upper limit of the confidence region, £ max , is found using simulations with the smallest 
allowable number density slope, 7 = 0, which implies the smallest number of expected 
pairs, and therefore the largest correlation function. Then, £ max , is equal to the value of the 
correlation function, evaluated in the 250-500 km/s bin, which has the number of pairs in 
the first bin larger than those observed in 95% of the trials. 
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In principle, the confidence regions will depend on all n-point correlation functions, or 
equivalently, the exact algorithm used to create the simulation. In this particular case where 
the pair counts in only one bin are examined, the 90% confidence regions given in Table |] 
is not expected to be too sensitive to the particular type of correlation function used in the 
simulated data. To test this hypothesis, a variety of two-point and three-point correlation 
functions were simulated and thereby used to construct confidence intervals. The intervals 
derived were found to be consistent with those reported in Table [| to within 5% of their 
values. Specifically, confidence regions were determined from simulated data with two-point 
correlation functions of the form which describes galaxies and clusters ( Peebles 1980| ) 



£( r ) = "I! ( r < r min) 

£(r) = (r/ro)""" 7 ; (r > r min ) 

with r min corresponding to the resolution of the data and the exponent, 7, ranging between 
1.3 and 2.3. A particular type of three-point correlation function which is appropriate for 
galaxies was also added with a strength parameter Q ranging from 0.5-1 ( Peebles 1980 ). 



While the Ly-a forest may not have the same two and three point correlation functions as 
described above, the important point is that the confidence regions of the strength of the 
correlation function are insensitive to the specific shape of correlation function. 

The top panel of Figure 3 illustrates the confidence intervals for one of the cases in 
Table |J For comparison, the galaxy-galaxy correlation function is shown in the bottom 
panel of the figure. The galaxy-galaxy correlation function is within the confidence limits 
determined by Monte Carlo simulations. 



5. Influence of Blends 

In this section, the reality of the close pairs and the possible influence of blends on the 
observed pairs is investigated. Table gives detailed information about each pair in the 
250-500 km/s bin which shows the excess. The average and median equivalent widths of 
the lines in the close pairs are indistinguishable from that of the entire sample. 

In the absence of clustering, the low number density of Ly-a lines at small redshift 
makes blending much less of a problem than at larger redshifts. The resolution of the 
Faint Object Spectrograph is 1300 (CAT1) which corresponds to 230 km/s. Then, the 
number of expected blended lines, that is, lines with separations less than 230 km/s, in 
the entire sample is ~4. If the blended lines were separated by the line selection algorithm 
into pairs of lines with separations greater than 250 km/s, an artificial excess of close 
pairs could be created in the sense that the 250-500 km/s bin would contain all pairs with 
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actual separations between and 500 km/s. This contamination is unlikely to contribute 
strongly to the excess detected pairs, because such artificial pairs should have separations 
just larger than the resolution limit, whereas the close pairs shown in Table [5] do not show 
this property. 

A single line convolved with the instrumental profile and sufficient noise may, in 
principle, be artificially split into a pair by the automated line selection algorithm. The 
efficiency of such splittings can be assessed by Monte Carlo experiments using the detailed 
simulation software developed by the QSO absorption Key Project team flSchneidci 



et al. 19"93| ). D. Schneider (private communication, 1996) generously ran 6000 lines through 



the Key Project line-simulation software which adds noise, accounts for instrumental 
resolution, and implements the same line selection routine that was used on the actual 
Hubble data. The results are very encouraging. Even, if all 100 lines in the sample had a 
huge internal velocity dispersion of 300 km/s (typical velocity dispersions are 15-50 km/s at 
z ~ 2, e.g. Kulkarni et al. 1996j ), and a large rest equivalent width of 0.67 as did the first 
3000 simulated lines (the actual lines are distributed as an exponential distribution with 
scale ~0.32 A), the estimated total number of lines expected to be artificially split into 
lines with separation greater than 250 km/s in the entire data set is only 0.56. The actual 
number of artificial pairs in the data is probably much smaller. First, many of the lines in 
the sample are so weak that even they could not be split into two lines which would exceed 
the uniform detection limit of 0.24A, rest equivalent width. The effective cutoff in rest 
equivalent width for a line to be be able to split into two lines in the sample occurs above 
2 x 0.24 = 0.48A, and is closer to 0.6A, because lines do not often split exactly into equal 
parts. The fraction of lines in the sample with rest equivalent width > 0.6A is ~ 0.3, so 
the total number of expected artificial pairs is reduced to ~0.17. Further, most Ly-a lines 
probably have velocity dispersions much less than 300 km/s, and are therefore less likely to 
split into artificial pairs. We conclude that the effect of blends is no more than 5% on the 
total number of detected pairs and probably substantially less. Even in the pathological 
case that all lines had gigantic velocity dispersions of 500 km/s which was the case for the 
second set of 3000 simulated lines, simulations and the same line of argument shows that 
the total number of artificial pairs is expected to be less than 0.7. This pathological case 
would affect slightly the confidence regions described in the previous section, but would still 
not account for all of the excess close pairs. 

Because the close pairs are real, the Ly-a lines are strongly clustered. This clustering 
will effect the number of blends in the medium resolution data and has important 
implications for further analyses of the Ly-a forest in this medium resolution data set. 
For instance, correlations affect the answer to the question: if the data could be viewed 
under higher resolution, would many of the lines be resolved into multiple systems? Under 
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the hypothesis that all lines are unclustered, one expects only about 4 (of 100) lines to be 
composed of two smaller lines (by random superpositions), because of the extremely low 
line-of-sight density of absorbers. The difference in inferred volumed density of of Lj-a 
absorbers would be only ~ 4% as obtained from high and low resolution observations. 
However, the lines appear to be strongly correlated, and the number of expected unresolved 
pairs (with Av < 230 km/s) is at least as large the total number of resolved pairs with 
230 km/s < Av < 460 km/s - about 10. If the correlation function increases for velocities 
less than 250 km/s, as may be the case for high redshift, higher resolution studies, then the 
number of unresolved pairs may be more than twice as large, or 20 (this would be the case, 
for instance, if the shape of the correlation function were a Gaussian with zero mean and 
scale ~ 250km/s). In the absence of clustering, resolution is expected to make a very small 
difference in the determination of the statistics of the Ly-a forest, e.g. the number density 
and number density evolution. Viewed under high resolution and with strong clustering, 
the number density and evolution of Ly-a forest may easily differ by factors of ~20% from 
their presently determined values. 



6. Discussion and Conclusion 

The two-point correlation function, £, of Ly-a forest lines, (z) = 0.7, is £ = 1.8lj;;|, 
90% confidence level, for separations of 250-500 km/s (see Table [|). 

How does the clustering at low redshifts compare with the recent detections of high 
redshift clustering? Between redshifts of 1.7 and 4, for similar velocity separations and 
equivalent width limits, it seems likely that the correlation function is ~ 0.2 on scales greater 
than 250 km/s (see |Cristiani 1995| and references therein; see [Kernadez-Soto et al. 1996 



for a different view). Additionally, Cristiani's investigation shows some evidence for the 
increase of the correlation function (for velocity separations of 100 km/s) with decreasing 
redshift. Similar findings have also been reported by Srianand (1996). 

In velocity space, the correlation function has strengthened with redshift from £ ~ 0.2 
at high redshifts to £ ~ 1 at the small redshifts considered. The interpretation of the 
increasing correlation function is not straightforward. For instance, the low and high 
redshift absorbers may not be the same (see the discussion in |UAT2| and references therein) . 



How does the detected strength of the Ly-a correlation function compare to that of 
galaxies? Such a comparison is motivated in part by the long-standing idea that Ly-o; 
absorbers are related to the halos of galaxies (e.g. Pahcall 1979| , |Mo 1994]) as well as by the 



finding that at very low redshift, perhaps one third or more of the Ly-a lines are associated 
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with galaxies (|Lanzetta et al. 1995 ). A zeroth-order comparison is made in the second 
panel of Figure 3 by plotting the galaxy-galaxy correlation function (z = 0) against the 
observed Ly-a correlation function. The two correlation functions are consistent within the 
limits of the data. Even the simplest galactic-halo Ly-a forest models have as many as 4 
free parameters: velocity dispersion of galaxies as a function of redshift, evolution of the 
galaxy-galaxy correlation function, fraction of galaxies with Ly-a lines, and the truncation 
point of the spatial two-point correlation function. Therefore, a detailed analysis of the idea 
that galactic halos and Ly-a lines are related will have to wait until greater amounts of data 
are collected and good estimates of both the amplitude and the shape of the correlation 
function become available. 



AU is supported by a NSF Graduate Research Fellowship. I thank J. N. Bahcall, 
J. Miralda-Escude, and H. J. Mo for stimulating and informative discussions, and D. P. 
Schneider for helping to investigate the possibility of false line pairs in the observations. 
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7. Appendix A 

In this appendix, the algorithm for simulating data with a given two-point correlation 
function is briefly described. A further description as well as sample code for implementing 
the algorithm will be provided elsewhere ( Ulmer 1996|) . It was necessary to devise a 



new algorithm, because traditional 3-dimensional algorithms (e.g. [Neyman, fc Scott 
1952| , [Soneira fc Peebles 19771 ) diverge in 1-dimension. 



The algorithm generates a 1-dimensional set of points with two-point correlation 
functions of the form 

£(r) = -1; (r < r min ) 

£( r ) = (r/r )-~<; (r > r min ) 
There are three free parameters in the correlation function: 7, tq and r mm . 



(7) 



The algorithm can be motivated by writing the conditional probability of finding one 
object in the element dr at a distance Ar from another object, 

dP = n(l + £(Ar))dr, (8) 

where n is the density. A first approach to generating a 1-dimensional data set with a 
two-point correlation function could be to (1) start with an object at r = and (2) place 
a second object at T\ = Ari according to a Poisson process with probability of an object 
being placed at Ar (> 0) as determined by Eq. |], (3) place the third object at r 2 = r x + Ar 2 
according to Eq. || that is place the object only with reference to the one directly preceding 
it (4) place the ith object at = ty_i + Ar,, and so on. The simulated data produced by 
this algorithm has some flaws. First, the density of the simulated data is not necessarily n, 
because the probability of placing an object is smaller than ndr when r < r min and greater 
than ndr when r > r min , and they need not cancel out. Second, the slope and scale of the 
two-point correlation in the simulated data are not exactly the same as specified by the 
input parameters, 7, r and r min . 

These problems can be solved by using a slightly different correlation function £ with 
an extra parameter, e, that can be tuned so that the algorithm prodeces the correct density. 
We write this correlation function as 



£(r) = (r/f )-~< - e; (r > r~ in ) ; 

where e is between and 1. The other input parameters, 7, r and rmin, differ slightly from 
7, r and r min . The final algorithm is therefore identical to the one give above expect that 
the probability of finding a point at a given separation, Ar from another is given by 

dP = n(l +i(Ar))dr (10) 
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The algorithm is able to produce data with the desired two-point correlation function 
with high accuracy to over 10 scale lengths in radius for the densities and correlations 
appropriate for this study. 



- 12 - 



REFERENCES 

Bahcall, J.N. 1979, in Longair, M.S. & Warner, J.N. eds., Proc of IAU Coll 54 on "Scientific 
Research with the Space Telescope," NAS CP-2111, p. 215 

Bahcall, J. N., et al. , 1993a, ApJS., 87, 1 (CAT1) 

Bahcall, J. N., et al. , 1996, ApJ., 457, 19 (CAT2) 

Bahcall, J.N., Jannuzi, B.T., Schneider, D.P., Hartig, G.F., Bohlin, R., & Hunkkarinen, V. 
1991, ApJ, 377, L5 

Bahcall, J. N., Jannuzi, B.T., Schneider, D.P, Hartig, G.F., & Green, R.F. 1992, ApJ, 397, 
68 

Bahcall, J. N., Jannuzi, B.T., Schneider, D.P, & Hartig, G.F., 1993b, ApJ, 405, 491 

Bechtold, J., 1987 in Bergeron, J. et al. eds., Proc. Third IAP Workshop, High Redshift 
and Primeval Galaxies, Editions Frontieres, Gif-sur-Yvette, p. 397 

Chernomordik, V.V., 1995, ApJ 440, 431 

Cowie, L.L., Songaila, A., Kim,T.-S., & Hu, E.M., 1995, AJ, 109 1522 



Cristiani, S. 1995, preprint, |astro-ph/9512086| and ESO preprint 1117 

Cristiani, S., D'Odorico, S., Fontana, A., Giallongo, E., Savaglio, S., 1995 MNRAS 273, 
1016 

Fernadez-Soto, A., Lanzetta, L.M., Barcons, X., Carswell, R.F., Webb, J.K., Yahil, A. 1996, 
ApJ, 460, L85 

Heisler, J., Hogan, C. J., & White, S. D. M., 1989, ApJ, 347, 52 

Kulkarni, V.P, Huang, K., Green, R.F., Bechtold, J., Welty, D.E., & York, D.G. 1996, 
MNRAS m press 

Lanzetta, K. M., Bowen, D. V., Tytler, D., & Webb, J. K. 1995, ApJ, 442, 538 

Mo, H.J., 1994, MNRAS 269, P49 

Neyman, J. & Scott, E.L. 1952, ApJ 116, 144 

Ostriker, J.P, Bajtlik, S., & Duncan, R.C. 1988, ApJ, 327, L35 



13 



Peebles, P.J.E., 1980, The Large Scale Structure of the Universe, Princeton University 
Press, Princeton, NJ 

Petitjean, P & Bergeron, J., 1994, A&A, 283, 759 

Rauch, M. & et al. 1992, ApJ 390, 387 

Sarajedini, V.L., Green, R.F., & Jannuzi, B.T. 1995, preprint |astro-ph/9508068 

Sargent, W.L.W., Boksenberg, A., Steidel, C.C., 1988, ApJS 68, 539 

Sargent, W. L. W., Young, P. J., Boksenberg, A., & Tytler, D. 1980, ApJS, 42, 41 

Sargent, W. L. W., Young, P. J., Schneider, D.P 1982, ApJ, 256, 374 

Schneider, D. P. et al. 1993, ApJS, 87, 45 

Schneider, D.P. 1996, private communication 

Soneira, R.M. & Peebles, P.J.E. 1977, ApJ, 211,1 

Sprinrad, H. et al. 1993, AJ 106, 1 

Srianand, R. 1996, ApJ, Submitted. 

Srianand, R. & Khare, P. 1994, MNRAS, 271, 81 

Ulmer, 1996, in preparation 

Webb, J.K., 1987, in Hewett, A., Burbidge, G., Fang, L.Z. eds., Proc. IAU Symp. 124, 
Observational Cosmology, Reidel, Dordrecht, p. 803 

Webb, J.K. & Barcons, X. 1991, MNRAS 250, 270 



This preprint was prepared with the AAS IATgX macros v4.0. 



-14- 



Table 1: Lyman-alpha lines in sample from the first HST Key Project catalog (CAT1). 
Daggers (t) denote extensive metal-line systems. Equivalent widths are given in the rest 
frame. All values are given in Angstroms. 
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Table 2: Lyman- alpha lines in sample from the second HST Key Project catalog (CAT2). 
Daggers (') denote extensive metal-line systems. Equivalent widths are given in the rest 
frame. All values are given in Angstroms. 
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Table 3: Coverage of each observed quasar spectrum (CATl, CAT2) in which the sensitivity 
to meet the uniform detection limit of 0.24 A described in §2 is achieved. Wavelengths are 
given in Angstroms. 
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Table 4: Probability of observed excess in first bin as determined from Monte Carlo 
simulations. The strength of the correlation function on the scale of 250-500 km/s, £, is large. 
The 90% confidence intervals, £ m i n and £ max , are determined by Monte Carlo simulations as 
described in § IV. They are conservative intervals in that they are slightly > 90% confidence 
intervals. 

Prob Metal lines Av or Ar £ £ min £ max 
0.0047 n^ Av L8 0^6 3X~ 

0.0011 yes Av 2.1 0.9 3.5 

0.0013 no Ar 2.0 0.9 3.7 

0.0004 yes Ar 2.2 1.0 3.5 
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Table 5: Ly-a line pairs in the range 250-500 km/s. Pairs with a dagger (t) are those pairs 
which include a metal-line system. Equivalent widths are given in rest frame coordinates. 
All values are given in Angstroms. 
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Fig. 1. — Observed number of pairs as a function of velocity separation (solid lines) show 
a strong excess at the smallest velocity scales. The expected number of pairs in absence 
of clustering (dashed lines) are given for two values of the number density evolution, 
7 = and 1.5 as determined from extensive Monte Carlo simulations. In the top panel, 
metal-line systems are excluded from the sample. 
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Fig. 2. — Same as Figure 1, except the number of pairs are measured as a function of 
comoving separation assuming no peculiar velocities for the absorbers. 
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Fig. 3. — The top panel shows the observed number of pairs (solid line) for the Ly-a lines 
(with metal lines excluded). The upper and lower confidence intervals as determined by 
Monte Carlo experiments are indicated with dashed lines. As described in §4, confidence 
limits apply to the first bin alone. The bottom panel shows the same observed number of 
pairs (solid line) as in the top panel with the galaxy-galaxy correlation function (z = 0) 
shown as a dashed line. The two functions are similar in strength; however, the absorbers 
likely have peculiar velocities, so the two correlation functions are not directly comparable. 
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